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Abstract. Krawtchouk polynomials appear in a variety of contexts, 
most notably as orthogonal polynomials and in coding theory via the 
Krawtchouk transform. We present an operator calculus formulation of 
the Krawtchouk transform that is suitable for computer implementa- 
tion. A positivity result for the Krawtchouk transform is shown. Then 
our approach is compared with the use of the Krawtchouk transform in 
coding theory where it appears in Mac Williams' and Delsarte's theorems 
t/3 , on weight enumerators. We conclude with a construction of Krawtchouk 

^, ' polynomials in an arbitrary finite number of variables, orthogonal with 

respect to the multinomial distribution. 
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Krawtchouk polynomials appear originally as orthogonal polynomials for the 
[_^ \ binomial distribution [4,10], and in coding theory via the Krawtchouk trans- 

form in the context of Mac Williams' theorem on weight enumerators as well in 
Delsarte's extension to association schemes [5,8]. They play a role in discrete 
formulations of quantum mechanics [2,6,7,9], transforms in optics [1], as well 
as in recent developments in image analysis [11]. 



Q.^ ' 1 Introduction 



O 



M , We present an operator calculus formulation of the Krawtchouk transform 

that not only is theoretically elucidating, it is highly suitable for computer im- 
plementation. A consequence of our formulation is a positivity theorem for the 
Krawtchouk transform of polynomials. We indicate connections with the trans- 
form appearing in coding theory. 

This work appeared in: 

Serge Autexier, Jacques Calmet, David Delahaye, Patrick D. F. Ion, Laurence Rideau, 
Renaud Rioboo, and Alan P. Sexton, editors. Intelligent Computer Mathematics, 10th 
International Conference, AISC 2010, 17th Symposium, Calculemus 2010, and 9th In- 
ternational Conference, MKM 2010, Paris, France, July 5-10, 2010. Proceedings, vol- 
ume 6167 of Lecture Notes in Computer Science. Springer, 2010. 
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2 Krawtchouk polynomials as a canonical Appell system 

2.1 Generating function 

Consider a Bernoulli random walk starting at the origin, jumping to the left with 
probability q, to the right with probability p, p + q = 1, pq ^ 0. After N steps, 
the position is x, with j ~ {N — x)/2 denoting the number of jumps to the left. 

Start with the generating function 

Giv) = (1 + 2qv)^^+^'>^\l - 2pi;)(^-")/2 

W n 

= (1 + 2qvf-i{l - 2pvy - V ^i^„ 

^ — ' n! 

n=0 

where we consider K^ as a function of x or of j according to context. 

2.2 Orthogonality 

We check orthogonality with respect to the binomial distribution, j running from 

to N, with corresponding probabilities I , jq^p^^^. Using angle brackets to 

denote expected value, we wish to show that {G{v)G{w)) is a function of the 
product vw. 

"XI ( ■ )q^P^^^ {^ + Mv + w) + 4q^vw)^-^ {1 - 2p{v + w) + Ap^vwy 

^ {p + 2pq{v + w) + Apq vw + q — 2pq{v + w) + 4p qvw) 
= (1 + Apqvw)^ 

which immediately gives the squared norms |[iir„||^ = (n!)^(^j(4|7g)". 

2.3 Canonical Appell system 

For an Appell system with generating function exp[2;x — iiJ(z)], a corresponding 
canonical Appell system has a generating function of the form 

exp[xC/(u) - tH{U{v))] 

where U{v) is analytic about the origin in C, with analytic inverse V{z)^ and 
H{z) is the logarithm of the Fourier-Laplace transform of the distribution of x 
at time 1. Here we have N replacing i, and write 

G{v) = {l + 2{q-p)v- Apqv^l^ (t3^' ^'' 
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identifying 

U{v) = I log -^-^ , and H{z) ^ log(pe" + qe-') 
2 i — 2pv 

One checks that 



logbe^(^) + ,e-^(") ) = P^\^ + <l\ll^ = {l + 2{q-p)v- Apqv'r^'^ 
which verifies the form ex.Y'[xU{v) — NH{U{v))]. 



Solving z — U{v) — U{V{z)), we find 

(e^ — e^^)/2) sinhz 



V{z)^ 

pe^ + qe~^ pe^ + qe~^ 

3 Krawtchouk expansions 

The generating function, with z — U{v), is 

G{v) = e— ^«(-) = V ^i^„(x,iV) 

n>0 

Rearrange to get, 

e^- = {pe^ + qe~^r T ( ""^' X ^^^^ 
^ ^ ^ /l^ Vpe^ + ge-^y n! 

We want to write the coefficients of the expansion in terms of I? = d/dx 
acting on a function of x. We cannot substitute z <r^ D directly, since the D and 
X do not commute. Introduce another variable s. Replacing z by Dg ~ d/ds, 
apply both sides to a function /(s): 



^^D. 



z — / \ pe^= + (je ^= / n! 



n>0 



Now we move the operators involving Dg past Kn{x, N). Letting s = 0, thinking 
of / as a function of x instead of s, we can replace Dg by our usual D = d/dx, 
to get 

-■^-^ n V ne-^ + qe -^ / 

0<n<Ar ^^ ^ ^ 

In other words, the coefficients of the Krawtchouk expansion of fix) are given 

by 

f{n) = -, (pe^ + ge-^)^-"(sinhD)"/(0) (1) 
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Theorem 1. For p > q, if a polynomial has positive coefficients, then the co- 
efficients of its Krawtchouk expansion are positive. For p = q = 1/2, we have 
nonnegativity of the Krawtchouk coefficients. 

Proof. Note that if iIj{D) is any formal power series ^ CnD"^ in D, then 



i){D)3 



^ m ' 



m\Cm 



is positive if Cm is. Now, write pe^ + qe^^ — cosh z + {p— q) sinh z. li p > q, then 
the power series expansion has positive coefficients so that the coefficients in the 
Krawtchouk expansion are positive as well. For p — q, we have nonnegativity. 

3.1 Matrix formulation 

As shown in [3], for Krawtchouk polynomials of degree at most N, one can use 
the matrix of D, denoted D, acting on the standard basis {1, x, x^, . . . , x^} re- 
placing the operator D in eq. (1). 



For example, take iV = 4. Wc have 

/O 1 0\ 

00200 
£)= 00030 

00004 
\0 0/ 

The basic matrices needed are computed directly from the power series for the 
exponential function. Thus, 

/I2p-1 1 2p-l 1 \ 



pe 



qe 



-D 






1 


4p- 


2 


3 8p-4 








1 




Qp-'i 6 













1 8p-4 


lo 










1 



where the relation p + q — 1 has been used and 

/01010\ 
00204 
sinh £) = 3 
00004 

yoooooy 

The nilpotence of D reduces the exponentials to polynomials in D, making 
computations with these matrices convenient and fast. 
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3.2 Functions of two or more variables 

For functions on the plane, on an {N + l)-by-(M + 1) grid, we can use tire 
product basis Kn{x,N) Km{y,M). The formula for the expansion coefficients 
has the product form, denoting Di = d/dx, D2 = d/dy, 

f{n,m) = -1- (pie^i+gie-^i)^-"(p2e^^+92e-^^)^-"(sinhi?i)"(sinhi?2)"/(0,0) 
n\ to! 

where we note the relations pi + qi = P2 + Q2 = 1- The basis functions are 
orthogonal with respect to the corresponding product of the marginal binomial 
distributions, i.e. the underlying random variables are independent. 

A similar construction works for 3 or more variables. In §5, we present an 
extension to multinomial distributions. 

4 Krawtchouk polynomials in coding theory 

The Krawtchouk transform appears in coding theory in a different variant as an 
essential component of Mac Williams' theorem on weight enumerators [5,8]. It 
appears in Delsarte's formulation in terms of association schemes as well. 

Fix A^ > 0. For Krawtchouk transforms on functions defined on {— A^, 2 — 
N, . . . ,N ~ 2, A^} wc use the Fourier-Krawtchouk matrices, <P, which we call 
"Kravchuk matrices" . The entries are the values of the Krawtchouk polynomials 
as functions on {—N, . . . , N}. Thus, via the mapping x = N — 2j,0<j<N, 
the columns correspond to values of x and we write 

G{v) == (1 + 2qv)^-^{l - 2pvy = Yl v'$,j 

i 

In [5] , the Krawtchouk polynomials are defined via a slightly different gener- 
ating function, changing the notation to fit our context, 

G,(«) = (1 + (s - l)t;)^--'(l - vy =J2v'Mj;N,s) 

i 

The difference is primarily one of scaling. Comparing G with Gg , replacing v by 
v/{2p), we find 

K,{j;N,s)^{2pr'<I>,, 

with s-l = i = i:iP = i-l, or 

p p p ' 

1 

s = — 
P 

The condition s > 2 thus corresponds to p < q, complementary to the condition 
required for positivity in the previous section. 

Following [5, p. 132], we have: 
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— Mac Williams' Theorem: If A is a linear code over Wg and B = A-^, its 
dual, then the weight distribution of B is, up to a factor, the Krawtchouk 
transform of the weight distribution of A. 

— Delsarte's Theorem: If A is a code over an alphabet of size s, then the values 
of the Krawtchouk transform of the coefficients of the distance enumerator 
are nonnegative. 

In this context, the components of the Krawtchouk transform of a vector v 
are defined by 

Vi = ^Ki{j;N,s)vj 

3 

We show how to invert the transform. 

4.1 Inverse transform 

The calculation of §2.2 can be recast in matrix form. Set B equal to the diagonal 
matrix 

B = diag(p^,iVp^-ig, ...,{^^^ p^-g^ . . . ,q^) 

Let r denote the diagonal matrix of squared norms, here without the factors of 
n!, 



r = diag (1, N{4pq), • • • , ( 7 ) (4m)\ • • • , (4^)"^) 
With G(v) = ^v^<Pij, we have, following the calculation of §2.2, 

= {G{v)G{w)) = Y^i^^yr,, 

i 

In other words, the orthogonality relation takes the form 



<PB$'^ = r 



from which the inverse is immediate 

<p-^ = B<P'^r-^ 

A consequence of this formula is that det ^ is independent of p and q: 

(det<?')2 = ^^= 2^(^+1) 
^ ' det B 

The sign can be checked for the symmetric case p = q = 1/2 with the result 

det<l' = ±2^(^+i)/2 
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with the + sign for A^ = 0, 3 (mod 4). 
We illustrate with an example. 
Example. For iV = 4, 

/ 1 1 1 

8(7 6(7 - 2p 4(7-47? 

24 (7^ 12 (7^ -UpqAq^ - Wpq + Ap'^ -12 pq+ 12 p'^ 24 p^ 
32^38(73-2475(72 -Wpq'^ + lGp'^q 24752,7-8733 -32p3 
\l6q* -1675^3 16752^2 -16p3g 16p4 / 

where we keep p and q to show the symmetry. 



$ 



1 1 \ 

2q — 6p — 8p 



/ 1675 



2^ B<P^r-^ 



•^ 8753 4752 2p 1 \ 

6475358752(3^-7)) Sp{q-p) 2q-6p -4 



96752^2 2Aqp{q — 75) ^q^ — l(Spq + ip"^ -6(7 + 675 6 
6475^3 8g2(g-3p) -8g((7-75) 69-275 -4 
\ 16 q^ -8g3 4g2 -2q 1 J 

satisfies Q<P ^ <PQ = 2* /. 



4.2 Modified involution property 

For p ~ 1/2, there is a simpler approach to inversion. Namely the identity 

<Z>2 = 2^ / 

We will see how this is modified for p ^ 1/2, which will result in a very simple 
form for ^~^. 

Proposition 1. Let P be the diagonal matrix 

P = diag((2p)",...,(2p)^-^...,1) 

Let P' be the diagonal matrix 

P'-diag(l,...,(2pF,...,(2pn 

Then 

<PP<P = 2^P' 

We just sketch the proof as it is similar to that for orthogonality. 
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Proof. The matrix equation is the same as the corresponding identity via gen- 
erating functions. Namely, 

First, sum over i, using the generating function G{v), with j replaced by k. Then 
sum over fc, again using the generating function. Finally, summing over j using 
the binomial theorem yields the desired result, via p + q = 1. 

Thus, 

Corollary 1. 



1-1 



^_i ^ 2~^ P<PP 



5 Krawtchouk polynomials in 2 or more variables 

Now we will show a general construction of Krawtchouk polynomials in variables 
Oil J2j • • • J Jd): running from to A^ according to the level N. These systems are 
analogous to wavelets in that they have a dimension, d, and a resolution N. 



5.1 Symmetric representation of a matrix 

Given a d x d matrix A, we will find the "symmetric representation" of A, the 
action on the symmetric tensor algebra of the underlying vector space. This is 
effectively the action of the matrix A on vectors extended to polynomials in d 
variables. 

Introduce commuting variables xi, . . . ,Xd- Map 



= E^ 



Vi — /_, ^ij ^j 



3 



We use multi-indices, m = mi, . . . , md, rrii > 0, similarly for n. Then a monomial 






and similarly for y". The induced map at level N has matrix elements A^ 
determined by the expansion 



"-E^« 



y 



One can order the matrix entries lexicographically corresponding to the mono- 
mials x™. We call this the induced matrix at level N. Often the level is called 
the degree since the induced matrix maps monomials of homogeneous degree N 
to polynomials of homogeneous degree N. 
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We introduce the special matrix B which is a diagonal matrix with multino- 
mial coefficients as entries. 



^nm — ^nm 



n J Till 722! • • • rirf! 



For simplicity, for B we will not explicitly denote the level or dimension, as it 
must be consistent with the context. 

The main feature of the map A — > A is that at each level it is a multiplicative 
homomorphism, that is, 

A1A2 = Ai A2 

as follows by applying the definition to yi — '^iAiA2)ijXj first to Ai, then to 
^2- Observe, then, that the identity map is preserved and that inverses map to 
inverses. However, transposes require a separate treatment. 

Transposed matrix. The basic lemma is the relation between the induced 
matrix of A with that of its transpose. We denote the transpose of A, e. g. , by 

Lemma 1. The induced matrices at each level satisfy 

W=B-^A'^B 
Proof. Start with the bilinear form F — y XiAiji/j. Then, from the definition 
oiA, 

n,m ^ ^ 

Now write F = y. ^« (^ )i* Vj with 



n.ni ^ ^ 

Matching the above expressions, we have, switching indices appropriately, 
(A^B) -A (^]-(^]w -(bW) 

\^^ ^-* )mn ^^nm 1 I — 1 I ^^ 'ran V /ran 

which is the required relation. 

5.2 General construction of orthogonal polynomials with respect to 
a multinomial distribution 

For the remainder of the article, we work in d + 1 dimensions, with the first 
coordinate subscripted with 0. 
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The multinomial distribution extends the binomial distribution to a sequence 
of independent random variables where one of d choices occurs at each step, 
choice i occurring with probability pi. The probability of none of the d choices 
is Pa — 1 — pi — p2 ~ ■•■ ^ Pd- The probabilities for a multinomial distribution 
at step iVare given by 

p(ii,i2, . . . , jd) = ( . . ^ . j^oVi' ---Pd 

\JO,Jl,j2,---,JdJ 

this being the joint probability distribution that after N trials, choice i has 
occurred ji times, with none of them occurring jo ^ N — ji — ■ ■ ■ — jd times. Let 
P denote the diagonal matrix with diagonal {po,pi, . . . ,Pd)- Then with 

yi — / ^ ^ij'^j — Pi^i 
3 

we see that y" — p"x" which implies that, at each level N, P is diagonal with 
entries 

f> r n N — ni Ud ni Ud 

^nm — OnmP — Pq P\ ' ' ' P^ 

In other words, the multinomial distribution comprises the entries of the diagonal 
matrix BP. 

Now for the construction. Start with an orthogonal matrix W, a diagonal 
matrix of probabilities P, and a diagonal matrix D with positive entries on the 
diagonal. Let 

^ = p-i/2^£,i/2 (2) 

where the power on a diagonal matrix is applied entrywise. Then it is readily 
checked that 

A^PA = D 

which will generate the squared norms of the polynomials, to be seen shortly. 
Using the homomorphism property, we have, via the Lemma, 

Wpa = b-^a^bPA = D 

Or, setting <P = A^ , 

'PBP'P'^ = BD (3) 

with both BP and BD diagonal. 

Taking as variables the column indices, writing j for m, we have the n*"^ 
Krawtchouk polynomial 

Kn{j;N,p)^^nj 

at level N. The relation given by equation (3) is the statement that 

the Krawtchouk polynomials are orthogonal with respect to the multinomial 
distribution, BP, with squared norms given by the entries of the induced matrix 
BD. 
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To summarize, starting with the matrix A as in equation (2), form the in- 
duced matrix at level N. Then the corresponding Krawtchouk polynomials are 
functions of the column labels of the transpose of the induced matrix. Apart 
from the labelling conventions, then, in fact they are polynomials in the row 
labels of the original induced matrix. 

Finally, note that the basic case arises from the choice 



A = 



1 1 
1 -1 



for d = 1 with p=l-p = 1/2, D = I. 

Remark 1. A useful way to get a symmetric orthogonal matrix W is to start 
with any vector, u, form the rank-one projection, V ~ vv^ /v^v and take for W 
the corresponding reflection 2V — I. 

5.3 Examples 

Two variables. For two variables, start with the 3x3 matrices 



A = 



(I 1 l\ 

1 -1 
Vl 1-2/ 



P = 



/1/3 \ 

1/2 
\ l/6y 



<p(2) 



and D the identity. We find for the level two induced matrix 

/I 1 1 1 1 l\ 

2 2-202 

2 1-1 0-2-4 

1-1 1 1-1 1 

2-1-1 2-4 

\1 0-2 4/ 

indicating the level explicitly. These are the values of the polynomials evaluated 
at integer values of the variables (ji, j2, • • •)• ^"^ multiplying a data vector on 
either side will give the corresponding Krawtchouk transform of that vector. 



Three variables. This example is very close to the basic case for d = 1. 

Start with the vector v^ = (1,-1,— 1,-1). Form the corresponding rank-one 

projection and the associated refiection, as indicated in the remark above. We 

find 

/I 1 1 1\ 



A 



1 1-1-1 
1-1 1-1 

yi-i-i ly 
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<^(2) 



and take the uniform distributfon pi = 1/4, and D = I. We find 

/I 1 1 1 1 1 1 1 1 l\ 
2 2 2 0-2-2-2 
2 2 0-2 0-2 2 0-2 
2 2-2-2 0-2 2 

1 1-1-1 1-1-1 1 1 1 

2 0-2-2 2 0-2 2 
2 0-2 0-2 2 2 0-2 
1-1 1-1 1-1 1 1-1 1 
2-2 2 0-2 2-2 

\1-1-1 1 1 1-1 1-1 1/ 

With A^ = AI, we have, in addition to the orthogonahty relation, as in equation 
(3), that (<l>(2))2 = 16/. 



6 Conclusion 

Using matrix methods allows for a clear formulation of the properties of Krawtchouk 
polynomials and Krawtchouk transforms. Working with polynomials or with 
vectors, computations can be done very efficiently. We have shown how to con- 
struct Krawtchouk polynomials in an arbitrary (finite) number of variables, with 
enough flexibility in the parameters to allow for a wide range of potential appli- 
cations. 



7 Appendix 

Here is maple code for producing the symmetric powers of a matrix. The argu- 
ments are the matrix X and the level N, denoted dg in the code, for "degree". 

SYMPOWER := proc(X, dg) 

local nd, ND, XX, x, y, strt, i, vv, yy, j, ww,kx,kk; 

nd := (linalg: -rowdim) (X) ; 

ND := (combinat : -binomial) (nd + dg - 1, dg) ; 

XX := matrix (ND, ND, 0); 

X := vector (nd) ; 

y := (linalg: -multiply) (X, x) ; 

strt := (combinat : -binomial) (nd + dg - 1, dg - 1) - 1; 

for i to ND do vv := (combinat : -inttovec) (strt + i, nd) ; 
yy := product (y[kk] ~vv[kk] , kk = 1 .. nd) ; 
for j to ND do WW := (combinat : -inttovec) (strt + j, nd) ; 
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XX[i, j] := coeftayKyy, 

[seq(x[kx], kx = 1 .. nd)] = [seq(0, kx = 1 .. nd)] , ww) ; 
end do ; 
end do ; 
evalm(XX) ; 
end: "outputs the symmetric power of X"; 
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